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Introduction 



These lectures were delivered at the "International Conference on Subjects Related to the Clay Problems" 
held at Chonbuk National University, Chonju, Korea in July, 2002. My aim was to give mathematicians and 
graduate students unfamiliar with analytic number theory an introduction to the theory of the Riemann 
zeta-function focusing, in particular, on the distribution of its zeros. Professor Y. Yildirin of the University 
of Ankara, who also delivered a set of lectures at the conference, concentrated on the distribution of prime 
numbers. 

A few general remarks about the lectures are in order before I summarize their contents. First, since 
I could only cover a small part of the subject in the time allotcd, my choices about what to include and 
exclude were necessarily personal. Second, I have glossed over a number of technical details in order to keep 
the focus on the main ideas. Finally, there is almost nothing new in the lectures. The exception is the 
description of a new random matrix model due to C. Hughes, J. Keating, and the author at the end of the 
third lecture. I should also add that this manuscript is a very close record of the lectures I delivered and 
this, I think, accounts for the somewhat breezy style. 

In the first lecture I presented the basic background material on the zeta-function, sketched a proof of the 
Prime Number Theorem, explained how the Riemann Hypothesis (RH) comes into the picture, and briefly 
summarized the evidence for it. 

In the second lecture I wanted to explain how one studies the distribution of the zeros and chose mean- 
value estimates as a unifying theme. I described what mean-value estimates are, gave several examples, and 
explained in a general way their connection with the zeros. I then sketched the ideas behind two applications 
- the most primitive zero-density estimate (due to H. Bohr and E. Landau) and the proof of N. Levinson's 
famous result that at least one-third of the zeros of the zeta-function lie on the critical line. Both results were 
cited in Lecture I as evidence for the Riemann Hypothesis. I had also intended to present the conditional 



The work of the author was supported in part by a grant from the National Science Foundation. 

1 



2 S.M. GONEK DEPARTMENT OF MATHEMATICS UNIVERSITY OF ROCHESTER ROCHESTER, N. Y. 14627 U.S.A. 

result of J. B. Conrey, A. Ghosh, and the author that more than seventy percent of the zeros are simple, but 
there was not enough time. However, I have included that application here. 

The third lecture began with the observation that the Riemann Hypothesis does not answer all our 
questions about the primes; one also needs detailed information about the vertical distribution of the zeros 
on the critical line. I then presented H. Montgomery's pioneering work on the pair correlation of the zeros. In 
the remainder of the lecture I stated the GUE hypothesis and described the most recent work on modeling 
the zeta-function by characteristic polynomials of random matrices from the Circular Unitary Ensemble 
(CUE). 

For those wishing to study the zeta-function in more depth, the most important books are by H. Davenport 
[D], H. M. Edwards [E], A. E. Ingham [12], A. Ivic [Iv], and E. C. Titchmarsh [Tl], [T2 ]. For a background 
in random matrix theory the reader should consult M. L. Mehta [M] and P. Dcift [Df]. 

I take this opportunity to thank the organizers and the many other fine Korean mathematicians I got to 
meet for the first time at the conference. Thanks also to the mathematicians and students who so warmly 
hosted us visiting mathematicians and made the conference such an enjoyable and memorable one. 
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Lecture I 

The Zeta-Function, Prime Numbers, and the Zeros 

Although most mathematicians are aware that the prime numbers, the Ricmann zeta-function, and the 
zeros of the zeta-function are intimately connected, very few know why. In this first lecture I will outline 
the basic properties of the zeta-function, sketch a proof of the prime number theorem, and show how the 
location of the zeros of the zeta-function directly influences the distribution of the primes. I will then explain 
why the Riemann Hypothesis (RH) is important and the evidence for it. 

1 The Riemann zeta-function 
The Ricmann zeta-function is defined by the Dirichlet series 

oo 

C( S ) = 5>-*, 

n=l 

which can also be written 

cm = +p~ s +p~ 2s +•••)= ri( i -^ s ) _i - 

P P 

where s = a + it is a complex variable. We immediately see that the zeta-function is built out of the 
prime numbers. Observe that the series and product both converge absolutely in the half-plane a > 1. 
Their equality in this region may be regarded as an analytic equivalent of the Fundamental Theorem of 
Arithmetic. For the Fundamental Theorem assures us that each term n~ s in the series occurs once, and only 
once, among the terms resulting from multiplying out the Eulcr product. Conversely, if we know the equality 
of the sum and product, the Fundamental Theorem follows. From the equality of the sum and product we 
can also deduce the well kown fact that there are an infinite number of primes. For if there were not, the 
product would remain bounded as a — > 1 + , whereas we know that the sum tends to infinity. 

Since no factor in the Euler product equals zero when a > 1, we deduce that £(s) ^ when a > 1 . Also, 
since the series converge absolutely when a > 1, it converges uniformly in compact subsets there. It follows 
that £(s) is analytic in the half-plane a > 1. 

The most fundamental properties of the zeta-function are: 

(1) Analytic continuation: ((s) has an analytic continuation to C except for a simple pole at s = 1. 

(2) Functional equation: The zeta-function satisfies the functional equation 

7r- s / 2 r(s/2)C(s) - n-^- s ^ 2 T((l - a)/2)C(l - a) . 

(3) Trivial zeros: The only zeros of ((s) in o < are simple ones at s = —2, —4, —6, .... 

(4) Nontrivial zeros: ((s) has infinitely many zeros p = (3 + i"f in the "critical strip" < a < 1. 
These lie symmetrically about the "critical line" u = 1/2, and about the real axis. 
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(5) Density of zeros in the critical strip: If N(T) denotes the number of zeros p = [3 + i-f in the 

critical strip with ordinates < 7 < T, then 

iV(T) = Jlog£-J + 0(lo g T) 

as T — > 00 . 

Since the zeros of £(s) are symmetric about the critical line, the simplest possible assumption is that they 
all lie on the line. This is the famous 

Riemann Hypothesis: If p = j3 + i-f is a nontrivial zero of the zeta-function, then (3 = 1/2. 

I will discuss the evidence for the truth of the Riemann Hypothesis later. First, however, I want to explain 
the most direct connection between the primes and zeros of the zeta-function. 

2 The Prime Number Theorem 
The Prime Number Theorem is the fundamental statistical fact about the primes. 

Prime Number Theorem. Let n(x) = J2 P < X 1- Then we have 

■k{x) <~ as x — > 00 . 

logx 

One interpretation of the theorem is that the probability that a positive integer chosen at random in the 
interval [1, x] is a prime equals 1/ log a;. Another is that the average distance between consecutive primes in 
the interval [l,x] is logx. 

For technical reasons, it is more convenient to express the theorem in the following form, which can be 
shown to be equivalent by partial summation. 

Prime Number Theorem (second version). Set 

logp if x = p k , 

if x ^ p k . 
Then we have 

ip{x) = ^ A(n) ~ x as x — > 00 . 

n<x 



A{x) = < 
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The proof I'll sketch here is based on the "explicit formula" , which is called that because it explicitly 
shows the relationship between the zeros and primes. 

We begin by assuming that 3?s > 1. From the Euler product representation for the zeta-function we see 
that 



Differentiating, we find that 



io g c( S )^-Eiog(i-p- s ) = EEfc37- 
p p fe=i 

4(«)=ei:^=e a(b) 



p k=l n 



Here we have used a consequence of the fact that ((s) has an Euler product, namely, that its logarithm and, 
therefore, its logarithmic derivative also have Dirichlet series representations. 

The idea now is to express the sum up to x of the coefficients A(n) of the last series (that is, ip(x)) as an 
integral transform. This is analogous to writing the Fourier coefficients of a periodic function as an integral. 

We break the argument into steps. 
Stepl. Note that 

i it»>i, 



2tT« h-ivo s 



1/2 if y = 1 , 
if < y < 1 . 

This is a standard exercise in complex function theory. If y > 1 we may pull the contour left to — oo. In 
doing so we pass a simple pole of the integrand with residue 1. If y < 1 we pull the contour right to +oo. 
This time we pass no poles, so the value of the integral is 0. When y = 1, we can calculate the Cauchy 
principal value of the integral directly, and it turns out to be 1/2. 
StepII. 

We use the formula above to evaluate 



27ri 7 2 _ ioo >v C S) ) s S ~2mJ 2 _ loc If- 



A(n) \ x s 
— as 



n" s 

71 = 2 / 



n=2 

The interchange of summation and integration is not quite justified here. We should really truncate the 
integral first and keep track of the error terms. But we will ignore this technical point so as not to obscure 
the main idea. 
StepIII. 

Evaluate the integral in Step II in a different way by pulling the contour left to — oo. We pick up residues from 
the simple poles of — ^(s)^- at i) the trivial and nontrivial zeros of ((s), ii) the pole of ((s) at s = 1, and 
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iii) the pole at s = 0. Calculating and summing the residues, and then equating the result to ip(x) — ^A(x) , 
we find that 




This is the "explicit formula" . Had we worked with a truncated integral over the interval [2 — iT, 2 + iT] , 
say, rather than the integral over [2 — zoo, 2 + too] (as we should have done to overcome the convergence 
problem in Step II), the sum over p's would also be truncated. The analysis is more complicated, but leads 
to a more useful form of the explicit formula, namely, 

i,{x)=x- —+£{x,T), 

| 7 |<T p 

where £{x, T) is a known error term. For the applications we have in mind, one can show that it is possible 
to choose T as a function of x in such a way that the error term is not significant. We therefore will not 
bother with the exact form of £(x,T). 

From the last form of the explicit formula one can almost see the Prime Number Theorem. Since \x p \ — 
the term involving the sum over zeros should be o(x) as long as the are not too close to 1. 
Indeed, using the estimate N(T) « TlogT, we see that the sum is 

<< ( max x p ) lor 1 << ( max x l3 )\og 2 T. 

0< 7 <T ' ^ 11 V 0< 7 <T ' 

0<7<T 

Now, one can show that the inequality < 1 — holds, where c is a positive constant, for every zero 
p = + ij. This leads to the Prime Number Theorem with an error term: 

ip(x) =x + (xe- b ^°^^j , 

with b a positive constant. Clearly the farther left the zeros all lie from the line 5is = 1, the better the error 
term. Since the zeros are symmetric about the line a = 1/2, the farthest left they can be is on the critical 
line, and in this case one can show that the sum over zeros in the explicit formula is 0(x 1 ' 2+e ). Thus the 
Riemann Hypothesis implies that 

iP{x) =x + 0(x^ 2+t ) . 

In fact, this statement also implies the Riemann Hypothesis. 

Why do we care about the error term? Because the main term just tells us the large scale behavior of 
the sequence of primes; all the detailed fluctuations in the counting function for the primes is hidden in the 
O-term. To illustrate this point, let us assume RH and consider the problem of how large the gaps between 
consecutive primes can be. From tp(x) = x + 0(x 1 / 2+e ) we easily see that 

ip(x + h) - il>(x) = h + 0(x 1/2+t ), l<h<x. 
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Now suppose that there are no primes in [x, x + h) (it can easily be shown that we may ignore the prime 
powers). Then = h + 0(.t 1 / 2+£ ), so we have h = 0(x 1 / 2+e ) . Thus, on RH there is a positive constant C 
such that the interval (a;, x + Ca; 1 / 2+e ] always contains a prime. Hence, the error term in the Prime Number 
Theorem has a bearing on the size of the maximal gap between primes. Had we not assumed RH, the same 
analysis would have only led to the assertion that every interval (x, x + Cxe~ b ^ l ° sx ] contains a prime. This 
of course is much weaker. 

3 The evidence for the Riemann Hypothesis 

I will conclude by indicating why we believe the Riemann Hypothesis. The main evidence supporting it 
is the following. 

(1) Zero— free regions: There is a region to the left of the line 5Rs = 1 that is free of zeros. More 
specifically, there is a positive constant c such that the region in the critical strip bounded by the 
curve a = 1 — c/log(|t| + 2) on the left, and a = 1 on the right, contains no zero of C( s )- (We 
used this fact when we deduced the Prime Number Theorem with error term.) The region has been 
widened slightly, but no one has been able to extend it to a vertical strip. The conjecture that there 
is such a strip is refered to as the Quasi-Riemann Hypothesis. 

(2) Zero— density estimates: Let N(<r, T) denote the number of zeros p = /3 + of the zeta-function 
such that a < /3 < 1 and < 7 < T. Many estimates have been proved of the type N(a, T) < T x ^ 
with A(er) < 1 and A(cr) decreasing for 1/2 < a < 1. 

(3) Calculations of zeros: The first fifty billion zeros of the zeta-function above the real axis have 
been shown to be simple and to lie on the critical line. Also, A. M. Odlyzko [O] has performed 
extensive computations showing, among many other things, that the nearest several hundred million 
zeros to the 10 20 th zero lie on the critical line. Zeros of many other L-functions have also been 
computed and all of these have been shown to lie on the (corresponding) critical line. 

(4) Estimates of zeros on the critical line: Let N (T) denote the number of zeros of ((s) on the 
critical line whose ordinates 7 satisfy < 7 < T . In 1914, Hardy [H] showed that N (T) — ► 00 with 
T. In 1921, Hardy and Littlcwood [HL2] showed that N (T) > T. Then, in 1942, A. Selberg [S] 
proved that Nq(T) > cN(T), for some positive constant c. Thus, a positive proportion of the zeros 
lie on the critical line. The constant c was quite small, but in 1974 N. Levinson [L], using a different 
method, showed that N n (T) > 1/3A(T). In 1989, B. Conrey [C], iincreased the proportion to more 
than 2/5. 

(5) The finite field case: It is possible to define analogues of the zeta-function for curves and varieties 
over finite fields. It has been shown that the analogous Riemann Hypotheses for these zeta-functions 
are true. 
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Lecture II 

Mean- Value Theorems and the Zeros: Three Applications 



1 AN INTRODUCTION TO MEAN- VALUE FORMULAS 

In this lecture I will explain what mean-value estimates are, give a sampling of some of the most important 
ones, and present three applications to the study of the zeros of the zeta-function. These should make it 
clear why they play such a central role in the theory. 

Let's begin with some general remarks on mean-value theorems. 

By a mean-value theorem we mean an estimate for an integral of the type 



Jo 



\F(<7 + it)\ 2 dt 



or 

i-T 



I 

JO 



F(a + it) dt 



as T — > oo, where F(s) is a function representable by a convergent Dirichlet series in some half-plane 3?s > a 
of the complex plane. The path of integration here need not lie in this half-plane. For example, we would 
like to know the size of the integrals 

I k (a,T) = [ T \((a + it)\ 2k dt, 
Jo 

for a > 1/2 and k a positive integer. Here F(s) — C(s) fe and its Dirichlet series converges only for a > 1. 

There are many variations on this theme. For example, one might also consider a discrete version, namely 
an estimate for a sum of the form 

R 



J2\F(o- r +lt r )f 



r=l 



where the points a r + it r lie in C. Another possibility is for F(s) to involve a parameter N, say. We then 
desire as uniform an estimate as possible in both N and T. The simplest case is when 



F( S ) = F N ( S ) = J2 



N 

71=1 
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is a Dirichlct polynomial. Here one can calculate the mean-value in a straightforward way. We have 



f \F N (<j + it)\ 2 dt= f |Va„n- ff 
Jo Jo n=1 

-T N N 
" , ° n =lro=l 



T N N 



m- a+lt dt 



N N _ r T 

VV^f / (m/nfdt 



T y> |a«| 2 a«a m An/m) lT - 1 

Z— / 7j2cr Z— / frj.m'l ' i lnfr n/m 

n=l Km,n<N 



(nm) <T i\ogn/m 



It is not difficult to show that the second term on the last line is 

i |2 



I |2 



n=l 

Hence, we find that 

r T N N 



f \Y^a n n-^dt = {T + 0{N\o g N))Y J ^ 

Ja n=l n=l 71 



From this we see that, as long as N <C T 1 6 for some small positive e, the asymptotic estimate 

r T n N 

/ iya»n-" f<ft = TV 
Jo . , 1 



n— 1 n— 1 



holds. On the other hand, when N ^> T the mean-value can be about as large as 

N I |2 



n=l 



Thus, the size of the mean-value is dominated by the contribution of "diagonal" terms when N is smaller 
than T but, in the opposite case, the main contribution may be from the "off-diagonal" terms. Goldston 
and Gonek [GG] have given a much more precise version of the mean-value formula for such "long" Dirichlct 
polynomials in terms of the size of the coefficient correlations sums X^=i a nO-n+h- 



2 Connections between zeros and mean- values 

Mean-value estimates are used in many ways to study the zeros of the zeta-function; indeed, this is one 
of the reasons that so much effort has been expended on them. Why should there be a connection? One 
direct link is the general relationship between the zeros of an analytic function and its average size as given 
by Jensen's Formula in classical function theory. 
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Jensen's Formula: Let f(z) be analytic for \z\ < R and suppose that /(0) ^ 0. If n, r 2 , . . . , r n are the moduli 
of all the zeros of f(z) inside \z\ < R, then 



Here we see that the size of the mean-value of log |/(z)|, this time around a circle, is related to the distribution 
of the zeros of f(z) inside that circle. There is an analogous result for rectangles, which is often more useful 
when working with Dirichlet series, namely, 

Littlewood's Lemma: Let /(s) be analytic and nonzero on the rectangle C with vertices coi ci) G \ +iT, and 
(To + iT, where <7o < o\ . Then 



where the sum runs over the zeros p of f(s) in C and "Dist( j o)" is the distance from p to the left edge of the 
rectangle. 

When we use Littlewood's Lemma below, it will turn out that only the first term on the right-hand side 
is significant. So in order to not get too technical, I will always use the result in the form 



where £ is an error term that can be ignored and may be different on different occassions. 
The Integral of the logarithm usually cannot be dealt with directly, so we often use the following trick. 
We have 



where the inequality follows from the arithmetic-geometric mean inequality. In this way we see a direct 
connection between the location of the zeros within a rectangle and the type of mean-values we have been 
considering. 






27r^Dist(p) 

pec 
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A SAMPLING OF MEAN-VALUE RESULTS 



A great deal of work has been devoted to estimating the means 




When k = 1 we know that for each fixed a > 1/2 



h(a,T) ~ c(a)T 
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as T — ► oo, where c(a) is a know function of a. In 1918 Hardy and Littlewood [HL1] proved that, on the 
critical line itself, 

Ii(l/2,T)~TlogT. 

What can such estimates tell us about the zeta-function? Comparing the result for a greater than 1/2 with 
that for a = 1/2, we see that the zeta-function tends to assume, on average, much larger values on the 
critical line than to the right of it. Since it also has many zeros on the critical line, we should expect the 
zeta-function to behave rather erratically there. 

The next higher moment was detrermined in 1926 by Ingham [II], who proved that 

/ 2 (l/2,T)~^log 4 T. 

Unfortunately, no asymptotic estimate for any k greater than 2 has ever been proven. Ramachandra [R] has 
shown that 

/ fe (l/2,T)»Tlog fe2 T, 

and we expect that 

/ fe (l/2,T)«Tlog fe2 T. 
Conrey and Ghosh [CGI] have conjectured that 

4(l/2,T)~ f ^ Ty Tlog fe2 T, 



where 



--nil'-;) E 



p 



— r 



r=0 

and gk is an unknown constant. Not only does a proof of the conjecture seem far off, but it is only recently 
that anyone been able to suggest a plausible value for gk- I will return to the problem of gk in the final 
lecture. 

Another type of mean-value important in applications is 



where 



and P(x) is a polynomial. Since 



ICO + it)M N (a + it)\ 2 dt , 



l<n<N ^ ^ N 



n=l 

we can view M^(s) as an approximation to the reciprocal of ((s) in 3?s > 1. We might then expect the 
approximation to hold (in some sense) inside the critical strip as well. If that is the case, we should also 
expect that multiplying the zeta-function by Mm dampens, or mollifies, the large values of zeta. Below we 
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will see two applications of this idea. The most general estimates known for such integrals are due to Conrey, 
Ghosh, and Gonek [CGG2], who obtained asymptotic estimates for them when the length of the Dirichlet 
polynomial M N (s) is N — T e with 9 < 1/2. Later, Conrey [C] used Kloosterman sum techniques to show 
that these formulas also hold for 9 < 4/7. 

Assuming the Riemann Hypothesis and the Generalized Lindcloff Hypothesis are true, Conrey, Ghosh, 
and Gonek [CGG2] also proved discrete versions of such mollified mean-values, including estimates for sums 
of the type 

J2 \('(p)M N (p)\\ 

0< 7 <T 

where 7 runs over the ordinates of the zeros p = (3 + i-f of ((s). The first result of this type, but without 
the polynomial Mjy, were proved by Gonek[G] under the assumption of the Riemann Hypothesis alone. 
Having presented a brief catalogue of mean-value estimates, I will now turn to a few of their applications. 

4 A SIMPLE ZERO-DENSITY ESTIMATE 

We want to show that there are relatively few zeros of the zeta-function in the right half of the critical 
strip. Let cto be a fixed real number strictly between 1/2 and 1 and let C be the rectangle in the complex 
plane with vertices at 2, 2 + iT, <r + iT, (Jo- Applying our (simplified) version of Littlcwood's Lemma, we 
see that 

VDist(p) = -^ / log(\((<T +it)\)dt + £, 

Where Dist(/o) is the distance of the zero p from the line 3?s = <7o- Now let a be a fixed real number with 
(To < cr < 1 and write N(a, T) for the number of zeros p — (3 + i-f of ((s) with a < (3 < 2 and < 7 < T. 
On the one hand, we have 

^Dist(p) > Dist(p) > (cr - a )N{(T,T). 

pec P ec 

On the other hand, 

i- £ log(|C(a + it) |) dt = ^[ log(|CK + it)\ 2 ) dt 

-£ log( ^/ T|C(,To + ^ |2) * 

by the arithmetic-geometric mean inequality, as before. The integral on the last line is Ik((To,T), which we 
have seen is ~ c(<t )T, where c(a ) is positive and independent of T. Thus, the last expression is 0(T) . It 
follows that 

N(<i,T) <C T . 

Since N(T) - £ logT, we see that 

N(a,T)/N(T) = 0( ] ± f ) 
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for any fixed a > 1/2. We may interpret this as saying that the proportion of zeros to the right of any line 
3?s = a > 1/2 is infinitesimal. 

This, the first zero-density estimate, was proved by H. Bohr and E. Landau [BL] in 1914. Since then 
much stronger results have been proven, typically of the form 



where A(er) < 1 for a > 1/2. Nevertheless, the underlying idea in the proof of many (but not all) of these 
results already appears here. 

5 Levinson's method 

Zero-density theorems tell us there are (relatively) few zeros to the right of the critical line. Our goal 
here is to sketch the metod of Levinson [L], which shows that there are many zeros on it. 
Recall that 



denote the number of zeros on the critical line up to height T. The important estimations of N (T) were: 

G. H. Hardy (1914) : N Q (T) -> oo (as T -> oo) 

G. H. Hardy-J. E. Littlewood (1921) : N (T) > cT 

A. Selberg (1942) : N (T) > c'N{T) 

N. Levinson (1974) : N Q (T) > |iV(T) 

J. B. Conrey (1989) : N (T) > fiV(T) 

In keeping with the theme of this lecture, I should point out that each of the last four results requires the 
use of mean- value theorems. 

Levinson's method begins with the following fact first proved by Speiser [Sp]. 
Theorem (Speiser). The Riemann Hypothesis is equivalent to the assertion that C( s ) does not vanish in 
the left half of the critical strip. 

In the early seventies, N. Levinson and H. L. Montgomery [LM] proved a quantitative version of this. Let 



N(a,T) « T Ma) , 



N(T) = #{p = /3 + «7 I C(P) - 0, < 7 < T} 



T 



~-logT 



and let 




N'_(T) = - | CV) = 0, -K < 1/2, < V < T} 



and 



7V_ (T) = #{/? = (3 + i 1 | ((p) =0, -K/3 < 1/2, < 7 < T} • 
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Theorem (Levinson-Montgomery). We have 7V_(T) = N'_(T) + O(logT) . 

The idea behind the proof is as follows. Let < a < 1/2 and let C denote the positively oriented rectangle 
with vertices a + iT/2, a + iT, —1 + iT, and —1 + iT/2 . By a standard method it is not difficult to show 
that 



C 

A arg — (s) 



= 0(logT), 

c 



independently of a. Given this, we see that 

2tt(# zeros of C'(s) inC - # zeros of ((s) inC) = O(logT). 

The theorem now follows on observing that a was arbitrary and by "adding" rectangles with top and bottom 
edges, respectively, at T and T/2, T/2 and T/4, .... 

We now sketch Levinson's method. We have just seen that 7V_(T) = N'_(T) + O(logT). Now, the 
nontrivial zeros of £(s) are symmetric about the critical line. Hence, the number of them lying to the right of 
the critical line, to the left of the line a = 2, and above the real axis up to height T is also N_(T). Therefore 

N(T) = N (T) + 2N_(T) 

= N (T) + 2N'_(T) + O{logT), 

or 

N (T) = N(T) - 2N'_(T) + O(logT). 

The size of the first term on the left hand side of the last line is known, namely, 
(1 + o{l))j- logT. Hence, if we can determine a sufficiently small upper bound for N'_(T), we can deduce a 
lower bound for No(T). 

To find such an upper bound it is convenient to first note that the zeros of C'( s ) m the region — 1 < a < 
1/2, < t < T, are identical to the zeros of £'(1 - s) in the reflected region 1/2 < a < 2, < t < T . One 
can also show, by the functional equation of the zeta-function, that C'(l — s) and G(s) — ((s) + ('(s)/L(s), 
where L(s) is essentially -MogT, have the same zeros in 1/2 < a < 2, < t < T . It turns out to be 
technically advantageous to count the zeros of G(s) rather than those of £'(1. — s). 

To bound the number of zeros of G(s) in this region, we apply Littlewood's Lemma. Let a = \ — r^fr' 
with S a small positive number, and let lZ a denote the rectangle whose vertices are at a, 2, 2 + iT, and a + iT. 
It would be natural to apply our abreviated form of the lemma to obtain 



J2 Dist G°*) = ^/ \og\G{a + it)\dt + £, 



P*en a 
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where p* denotes a zero of G(s) and Dist(p*) is its distance to the left edge of lZ a . However, in the next 
step, when we apply the arithmetic-geometric mean inequality to the integral, we would lose too much. To 
avoid this loss, we first dampen, or mollify, G(s) and apply Littlewood's Lemma in the form 



1 f T 

V Dist(p**) = — / log \G (a + it) M {a + it)\dt + £ . 

r~l 2lT Jo 



gm{ p ")=o 



Here 



i<T e 

approximates 1/C( S ) and 9 > 0. Note that included among the zeros of G(s)M(s) in lZ a arc all the zeros of 
G(s) in lZ a . Therefore we have 

£ Distant £ Dist( P *) 

GM(p")=0 G(p*)=0 

> Dist ^*) 

p*eR a ,»p'>i/2 

G(p*)=0 

>^-a)N'_(T). 



We now see that 

(1/2 — a)N'(T) < / Iog|GM(a + it)|(ft + f 
= -3- / log |GM (a + it)\ 2 dt + £ 
< J log ^ £ \GM(a + it)\ 2 dt^j + £ . 

Thus, we require an estimate for 

/ \GM(a + it)\ 2 dt. 
Jo 

This is similar to a mean-value we saw in Section 3. Levinson was able prove an asymptotic estimate for 
this integral when 9 = 1/2 — e with e arbitrarily small. The resulting upper bound for N'_{T) then led to 
the lower bound 

N (T)> Q+o(l)) N(T). 
Much later, Conrey was able to establish an asymptotic estimate when 6 = 4/7 — e, which led to 



N (T)> (J N(T). 



The form of the asymptotic estimate in both cases is the same as a function of 6, and D. Farmer [F] has 
given various heuristic arguments that suggest it should remain true even when one takes 9 arbitrarily large. 
From Farmer's conjecture it follows that 
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N (T)~N(T) . 

Before concluding this section, we remark that had we introduced a mollificr into our proof of the Bohr- 
Landau result in the previous section, we would have obtained a much stronger zero-density estimate. 

6 The number of simple zeros 

Our final application demonstrates the use of discrete mean-value theorems. 
Let 

N S (T) = #{p = P + h\ C(p) = 0, C(p) ± 0, < 7 < T} 

denote the number of simple zeros of the zeta-function in the critical strip with ordinates between and 
T. It is believed that all the nontrivial zeros are on the critical line and simple, in other words, that 
N(T) = N (T) = N S (T) for every T > 0. In 1973, H. Montgomery [Mo], used his pair correlation method 
to show that if the Riemann Hypothesis is true, then at least 2/3 of the zeros are simple. In other words, 

N S {T)/N{T) > 2/3 

provided that T is sufficiently large. We will present his argument in the third lecture. Now, however, 
we briefly describe a different method of Conrey, Ghosh, and Gonek [CGG1], which shows that on the 
stronger hypotheses of RH and the Generalized Lindeloff Hypothesis, one can replace the 2/3 above by 
19/27= .703.... 

By the Cauchy-Schwarz inequality, we have 

]T C'(1/2 + *7)MaK1/2 + h)| 2 < ( £ X )( E K \ P )M N {p)\ 2 ) , 

0<7<T 0<7<T 0<7<T 

1/2+17 is simple 

where Mjy(s) is a Dirichlct polynomial of length N with coefficients similar, but not identical, to those of 
M(s) in the last section. Its purpose is also similar: to mollify ( (1/2 + i-f) so as to minimize the loss 
in applyng the Cauchy-Schwarz inequality. If one assumes RH, the sum on the left-hand side is easy to 
compute and turns out to be ~ i|iV(T)logT. The sum on the right-hand side is much more difficult to 
treat, but one can show that if RH and GLH are true, then it is <~ ||7V(T)log 2 T. Inserting these estimates 
into the inequality above and solving for N S (T) leads to the result stated. An elaboration of the method 
leads to the conclusion that, on the same hypotheses, at least 95.5% of the zeros of ((s) are either simple or 
double. 
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Lecture III 



Beyond the Riemann Hypothesis 



1 



Gaps between primes again 



In the first lecture we saw that the Prime Number Theorem with error term implies that if 4>(x+h) — ip( x ) = 
0, then there is a positive constant c\ such that h -C xe~ Cl ^ l ° sx . We also saw that if the Riemann Hypothesis 
is true, then h <C x 1 / 2+e for any positive e. The prime powers higher than the first contribute at most 0(x 1 ^ 2 ) 
to il>(x + h) — ip(x), so another way to phrase this is that the size of the gap between any two consecutive 
primes p and p' is 0(pe~ Cl ^ losp ) unconditionally, and 0(p 1//2+£ ) on RH. On the other hand, the Prime 
Number Theorem tells us that the size of the average gap between p and p' is <~ log p. This suggests that if 
the primes behave "randomly" , then p'-j)<j) e , and the numerical evidence does indeed support this. 

Here we have a problem for which even the assumption of the Riemann Hypothesis does not seem to give 
the right answer. The question I want to begin with here is: Why? 

The answer is not difficult to find. Consider again the explicit formula 



where £(x, T) is a known error term and, from now on, we assume the Riemann Hypothesis. If we apply the 
formula with the arguments x + h and x and subtract, we obtain 



There is likely to be a lot of cancellation in the sum in the integrand. However, when we estimated the error 
term in the Prime Number Theorem, we lost it all by putting absolute values around the individual terms. 
Clearly this cancellation depends completely on the distribution of the sequence of ordinates 7. In other 
words, on the vertical distribution of the zeros of the zeta-function. 

This example is not unique; it often happens that the strength of the Riemann Hypothesis, or even of 
the Generalized Riemann Hypothesis, is not sufficient to establish what we think is the ultimate truth in 
important arithmetical questions. We also often find that we need to understand the vertical distribution of 
the zeros of the zeta-function and L-functions. 



tjj(x) = x — 



\l\<T 




h\<T 
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2 Pair correlation 

Prior to the early seventies, such an understanding seemed beyond reach. Then, in 1973, Hugh Mont- 
gomery [Mo] found a way to study the distribution of the differences between all pairs of ordinates of zeros 
of the Riemann zeta-function, assuming RH is true. 

Montgomery's Theorem: Assume the Riemann Hypothesis. Set w(u) = 4 ^ ui , and for a real and T > 2 
write 



F(a) = F(a,T) = (— logTT 1 ]T u;( 7 - V) T ia ^~^ 



0<7,7'<T 

where 7 and 7' run over ordinates of zeros of the Riemann zeta-function. Then F(a) is real and an even 
function of a. Moreover, for any e > we have 

F(a) = (1 + o(l))T- 2a log T + a + o(l) (asT -» 00) 

uniformly for < a < 1 — e. 

It was later observed that F(a) is nonnegative. 
Integrating F(a) against a kernel f (a), we see that 

(— logT) / F(a)f(a)da= / ^ 10(7 - 7') T ia ^-^r(a) da 

J -co 0<7i7 /< T 

/oo 
r*«(7-7')f( a ) da 

= r ((7-7')^-M7-7'), 

0<7,7'<T 

where r is the inverse Fourier transform of f, that is, 

/oo 
f(a)e 2mau da . 
-00 

Thus, the integral of F(a) against a kernel f produces a sum involving the inverse transform r evaluated at 
the differences of pairs of ordinates. Since Montgomery's Theorem is only valid in the range — 1 < a < 1, one 
can only use kernels r{a) supported on (—1, 1). For example, assuming RH and taking f(a) = max{0, /3 _1 (1 — 
|a//3|)} with < /3 < 1, one obtains 

y sin((/V2)(7 - 7') logT) 2 _ 1 T 

0<7 ^< T l (/?/2)(7-V)logT ' 17 7j l /3 3 J 2rr g 

Montgomery used this to obtain a lower bound for the number of simple zeros of the zeta-function as 

follows. First observe that 

y i< y ( ^((/?/2)(7-y)iogT) 2 

0< 7 ^<T "0<7^T (^)(7-70logT ^ ^ ^ 
7=7' 
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Taking (3 = 1 — e in this, we obtain 

E l<(|+o(l))£logT. 

0<7,7'<T 
7=7' 

Now, if the zero \ + ij has multiplicity m(-f), then each 7 occurs 771(7) times in the sum on the left. Thus, 
we have 

E 1= E "»w 

0<7, 7 '<T 0<7<T 
7=7' 

and therefore 

E ^(7) < (l +0 (l))^logT. 

0<7<T 

Finally, we easily see that 

]T 1> E (2-™ 7 )>(2-^+ (l))^logT. 

0<7<T 0<7<T 

§+47 is simple 

Hence, if the Riemann Hypothesis is true, then at least two-thirds of the zeros are simple. Although this is 
not quite as strong a result as that obtained in Lecture II, namely (19/27 + o(l))^ logT, the hyotheses are 
also not as strong. For there we needed to assume the Generalized Lindeloff Hypothesis in addition to RH. 

Since we have focused so much on mean-value theorems, I should point out that Montgomery proved his 
theorem by relating F(a) to the mean-value of a Dirichlet series, namely, 

- / T I E ^{n){-r 1/2+U + E A(n)(-) 3 / 2 +*T dt , 

u n<x n>x 

where x — T a . Here we see a different explicit connection between the zeros and the primes. Indeed, Mont- 
gomery's starting point was a generalization of the explicit formula we saw in Lecture I (and again at the 
beginning of this lecture). The restriction a < 1 in Montgomery's Theorem arises for a familiar reason: when 
a > 1, the off-diagonal terms in the integral above contribute to the main term in the mean-value estimate. 
To determine this contribution (heuristically) , Montgomery used a strong form of the Hardy-Littlcwood 
twin prime conjecture. In this way he arrived at 

Montgomery's Conjecture: We have 

F(a,T) = (l + o(l)) (asT^oo) 

for a > 1, uniformly in bounded intervals. 

This together with Montgomery's theorem determines F(a) on all of R. Thus, one may use the conjec- 
ture to integrate F(a) against a much wider class of kernels than just those supported in (—1, 1). Using an 
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appropriate kernel he arrived at the 



Pair Correlation Conjecture: For any fixed a and (3 with a < (3, we have 



Y,V<T V a V 



\ 2 ■ 

sin7ra; \ , „, „. \ T 



0<t,l'<T 
2tt« / log T<7' — r<2ir/3/ log T 



dx + <5(a,/?) — logT 



TTX J / 2tT 



as T tends to infinity, where S(a, (3) = 1 if <G [a, /?], and <5(a, (3) = otherwise. 



The Pair Correlation Conjecture is an assertion about the distribution of the set of all differences between 
pairs of ordinates of the zeros. An enormous amount of data concerning the zeros has been collected and 
analyzed by A. M. Odlyzko [O], and the fit with the conjecture is remarkable. 

As an example of the type of information we can deduce from it, let < a < (3 with a arbitrarily small. 
Then we find that 



7, 7 < 1 \ 



0<7,7 <T 
0<7'-7<27r/3/logT 



/3 1 /sinrnzA 2 \ T m 
l - dx —logT. 



This shows that an infinite number of the zeros have another zero no farther away than 2tt(3/ logT, no matter 
how small (3 is. We also deduce that 



:7,7 <t \ 



/sinrrxX 2 , \ T 

l - + l — logT . 



-/3 

Combining this with the previous formula, we obtain 



0<7,7 <T 
-2ttP/ log T<7'-7<2?r/3/ logT 



TTX J 2n 



0<7, 7'<T 



7 =7 

By our earlier discussion, we may write this as 



J2 Ml)-^ogT. 



0<7<T 

On the other hand, von Mangoldt's formula tells us that 



0<7<T 

It thrcforc follows that 



0<7<T 

^+»7 is simple 



In other words, almost all the zeros are simple. 

Before moving on we mention that D. Goldston and H. Montgomery [GM] have shown that the Pair 
Correlation Conjecture is equivalent to a certain estimate of the variance of the number of primes numbers 



THREE LECTURES ON THE RIEMANN ZETA-FUNCTION 21 

in short intervals. D. Goldston, S. Gonck, and H. Montgomery [GGM] have shown that it is also equivalent 
to an estimate for the mean-value 



T ft 2 







[a + it) 



dt, 



for a near 1/2. Estimates of F(a, T) when a > 1 remain elusive. The only progress in this direction so far is 
the lower bound F{a, T) > 3/2 — a + o(l) on the interval (1, 3/2) under the assumption of the Generalized 
Riemannn Hypothesis. This is due to D. Goldston, S. Gonek, A. E. Ozliik, and C. Snyder [GGOS]. 

3 Random matrix theory 

Shortly after completing the work described above, Montgomery was told by F. Dyson that the "form 
factor" 1 — (— ) 2 m the distribution law he had conjectured for pairs of zeros of the zcta function is the 
same one that holds for pairs of eigenvalues of large random Hermitian matrices from the Gaussian Unitary 
Ensemble, or GUE, which we describe below. This and other matrix ensembles had been studied by physi- 
cists for decades because they can be used to model the Hamiltonians of complicated physical systems. The 
spectra, or energy levels, of such systems are given by the eigenvalues of the corresponding Hamiltonian. But 
in complicated situations, the Hamiltonian, let alone its eigenvalues, may not be known with any certainty. 
In such cases the Hamiltonian can be modeled by large random Hermitian matrices with symmetry properties 
dictated by the physical situation. It is found that the average behavior of the eigenvalues of such families of 
matrices is often in agreement with the experimental data. Physicists are particularly interested in knowing 
various statistics of the energy levels, and pair correlation is merely one of these. They had also worked out 
"n-level" correlations of the eigenvalues, and Montgomery conjectured that the analogous law (there is a nor- 
malization one has to take into account) holds for the "n-level" correlations of the zeros. Specifically, we have 



Montgomery's GUE Hypothesis: The distribution of all (n — l)-tuples (72 — 71,73 — 71, . . . ,7„ — 71), 
with the 7i ordinates of the zeros, has the form factor det K(x\, . . . , x n ), where 

vi \ - (v. \n h _ -1 t sinTT^ - xj) 

ir{Xi-Xj) 

The Pair Correlation Conjecture is the n = 2 case. Odlyzko [O] also used his data (alluded to above) to 
check this prediction, and the evidence is again compelling. Moreover, so is the theoretical support (see, for 
example, E. Bogomolny and J. Keating [BK], D. Hejhal [He], and Z. Rudnik and P. Sarnak [RS]). 

Finally, the Gaussian Unitary Ensemble of order N is the set of all N x N Hermitian matrices H = 
{Hj t k)i<j,k<N made into a probability space by equipping it with a probability measure p(H)dH, invariant 
under conjugation by all N x N Unitary matrices, where 

j<k j<k 
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and 



p(JT) = TT J- e - H h TT l e -2mH 3 , k f + (^ k r) 

J - J - \TK - LJ - 7T 

n AT V n ^ t. 



In practice it is often easier to work with the so called Circular Unitary Ensemble, or CUE, rather than 
the CUE. This is the compact group of N x N unitary matrices equipped with Haar measure normalized so 
that the measure of the group is 1. All eigenvalues have modulus one and the statistics of the eigenanglcs 
are known to be the same as those for the GUE eigenvalues. 

4 Applications of random matrix theory to the zeta-function. 

Another remarkable development in the application of random matrix theory to analytic number theory 
has been the discovery by J. Keating and N. Snaith [KS] that the characteristic polynomial of a large random 
matrix from the Gaussian Unitary Ensemble or Circular Unitary Ensemble can be used to model the Riemann 
zeta-function and other L-functions. 

The idea is as follows. Since Riemann's function £(s) is entire, it has a Hadamard product representation. 
Moreover, ((s) and £(s) are the same up to well understood multiplicative factors. Therefore, one might 
plausibly assume that at a large height t in the critical strip, ((s) (with s = a + it) should behave like a 
polynomial with the same zeros near t. If the zeros are distributed like the eigenanglcs of matrices from the 
Circular Unitary Ensemble, one might then expect 

N 

Z N (U,9) = l[(l-e^- e '>), 

n=l 

where the 9 n are the eigenangles of a random N x N unitary matrix U from CUE, to model C(l/2 + it). For 
scaling reasons one takes N = logt. 

Keating and Snaith conjecture that the average of \Zn(U, 6)\ 2k over the full Circular Unitary Ensemble, 
with respect to Haar measure on the group, should be directly related to the 2fcth moment 



T 1 



4(1/2, T) = \C(- +it )\™dt 

of the zeta-function. Similarly, the distribution of values of \ogZx{U, 6), say, should be the same as that of 
logC(| + it) . The agreement with known results in both cases is remarkable. 

Consider the case of Ik- Recall from Lecture II that it had long been conjectured that there is a constant 
Ck such that 

/ fe (l/2,T)~c fc Tlog fe2 T 

as T — > oo. J. B. Conrey and A. Ghosh [CG] have recast the conjecture into a more precise form, namely 
that 

Cfe = 3feafe/r(/c 2 + 1) , 
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where 



iV*- 1 ) 2 ^ a -in 2 



» / z — ' \ r 

• p/ r=0 V 



and <7fe is an integer. Thus the question comes down to the value of g^. The only proven values are the 
classical ones due to Hardy and Littlewood [HL1] and Ingham [II] of g\ = 1 and 172 = 2 , respectively. 
Conrey and Ghosh [CG2] conjectured that gz = 42 and, using long Dirichlet polynomials to approximate 
C(s) k , Conrey and Gonek [CGo] conjectured that g^ = 24024. At about the same time, Keating and Snaith 
[KS] calculated arbitrary complex moments of the characteristic polynomials Zn(U,6) averaged over all 
N x N matrices U in the CUE, and when k = 1, 2, 3, and 4 they obtained the same values for the numbers 
corresponding to gk as those above. They argued that one could therefore model the moments 7fc(l/2, T) by 
the average of \Zn(U, 9)\ 2k over CUE and conjectured that 

Interestingly, the Keating-Snaith and Conrey-Gonek conjectures were first publicly announced at the Rie- 
mann Hypothesis Conference in Vienna, just moments after it was checked that the Keating-Snaith conjec- 
ture in fact predicts that 174 = 24024 . 

The characteristic polynomial model has proven to be extremely powerful for predicting other behavior 
of the zeta-function and L-functions that once seemed hopelessly beyond reach. In fact, to a large extent it 
has been responsible for an explosion of activity in the field and of collaboration between number theorists 
and theoretical physicists. 

Impressive as the characteristic polynomial model has proven to be, it has the obvious drawback that it 
contains no arithmetical information. The prime numbers do not appear in this model of the zeta-function! 
In the moment problem, this is reflected by the absence of the arithmetical factor in the Keating-Snaith 
conjecture. They had to insert it in an ad hoc way. Fortunately, in the moment problem, it was only the 
factor gk and not ctk that proven elusive. A precise and more satisfactory model for the zeta-function (and 
other L-functions) clearly has to include such relevant information. 

In work in progress with J. Keating and C. Hughes, we have now succeeded in finding such a model for 
Q(s) and it can easily be generalized to model any L-function. I will conclude this lecture by describing the 
new model. 

Roughly, we have proven that if the Riemann Hypothesis is true, then for t e M and x > 1 we have 
C(l+^)=ex P [ J2 A(n)/(nW*logn) J flexp (^((i - 7 „) log*)) 

\2<n<x J n 

(times an error term that is essentially 1), where E\ (z) — 2— dw is the exponential integral. I say 
"roughly" because one also has to include smooth weights in the various factors. A similar formula holds 
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throughout the critical strip. Since we expect the ordinates j n of the zeros to behave like the eigenanglcs 
9 n of N x N random matrices in CUE, and "scaling" suggests that we take N to be the nearest integer to 
logi, we take as our model for zeta 

exp I Y, Hn}/(n 1/2+it logra) J J] exp (^((0 - 6 n ) logs)) . 

\2<n<x J n<N 

The presence of the exponential integral makes it a little complicated to compare this with the previous 
model, 

N 
n=l 

We note, however,that if 8 n is not too near 9, then the new model looks approximately like 

JJ (i-p-U/a-Ht))- 1 TJ exp (i _ x W-o n )j 

p<x n<N 

Here we clearly see both the primes and the zeros, and how the parameter x serves to connect them. The 
moments 7^(1/2, T) should now be given by the product of two moments- one being the 2fcth power of the 
modulus of the product over primes integrated with respect to t, the other being the 2fcth power of the 
modulus of the product over the eigenangles averaged over the Circular Unitary Ensemble. We call the 
conjecture that the mean can be computed this way, that is, as a product of two different types of means, 
the "Splitting Conjecture". 

The new model seems promising for many other investigations as well. To give just one example, we hope 
to use it to understand the horizontal distribution of the zeros of Q'{s) in the right half of the critical strip, 
a problem that has long defied us. We also expect it to give us more insight into the connection between 
primes and zeros. If we are extremely lucky, perhaps we will even find explicit and useful connections between 
primes in special sequences, such as twin primes or primes of the form n 2 + 1, and the zeros. 
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